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^ Abstract 

CN In this paper we prove several new stability results for the reconstruction of bi- 

I nary images from two projections. We consider an original image that is uniquely 

determined by its projections and possible reconstructions from slightly different 
projections. We show that for a given difference in the projections, the reconstruc- 
(-^ tion can only be disjoint from the original image if the size of the image is not 

too large. We also prove an upper bound for the size of the image given the er- 
^ ror in the projections and the size of the intersection between the image and the 

I— I reconstruction. 
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5^ Discrete tomograpliy is concerned with problems such as reconstructing binary 

images on a lattice from given projections in lattice directions |6j. Each point 
of a binary image has a value equal to zero or one. The line sum of a line 
through the image is the sum of the values of the points on this line. The 
projection of the image in a certain lattice direction consists of all the line 
sums of the lines through the image in this direction. 



Several problems related to the reconstruction of binary images from two or 
more projections have been described in the literature |6|7] . Already in 1957, 
Ryser gave an algorithm to reconstruct binary images from their horizontal 
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and vertical projections and characterised the set of projections that corre- 
spond to a unique binary image [TT]. For any set of directions, it is possible 
to construct images that are not uniquely determined by their projections in 
those directions ^ Theorem 4.3.1]. The problem of deciding whether an image 
is uniquely determined by its projections and the problem of reconstructing it 
are NP-hard for any set of more than two directions [1]. 

Aside from various interesting theoretical problems, discrete tomography also 
has applications in a wide range of fields. The most important are electron 
microscopy [8] and medical imaging [51IT3] . but there are also applications in 
nuclear science plITO] and various other fields [T2|IT5] . 

An interesting problem in discrete tomography is the stability of reconstruc- 
tions. Even if an image is uniquely determined by its projections, a very small 
error in the projections may lead to a completely different reconstruction [T1I3] . 
Alpers et al. [T|2] showed that in the case of two directions a total error of 
at most 2 in the projections can only cause a small difference in the recon- 
struction. They also proved a lower bound on the error if the reconstruction 
is disjoint from the original image. 

In this paper we improve this bound, and we resolve the open problem of 
stability with a projection error greater than 2. 



2 Notation and statement of the problems 



Let Fi and F2 be two finite subsets of with characteristic functions xi and 
X2- (That is, Xh{x,y) = 1 if and only if {x,y) G Fh, h e {1,2}.) For z G Z, we 
define row i as the set {{x,y) : x = i}. We call i the index of the row. For 
j G Z, we define column j as the set {(x, y) G : ?/ = j}. We call j the index 
of the column. Following matrix notation, we use row numbers that increase 
when going downwards and column numbers that increase when going to the 
right. 

The row sum rf"^ is the number of elements of F^ in row i, that is rf'^^ = 
J2j£zXh{hj)- The column sum c^J^^ of F^ is the number of elements of F^ in 

column j, that is Cj^^ = J2i&z Xh{iij)- We refer to both row and column sums 
as the line sums of Fh- 

Throughout this paper, we assume that Fi is uniquely determined by its row 
and column sums. Such sets were studied by, among others, Ryser [TT] and 
Wang [13]. Let a be the number of rows and h the number of columns that 
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Fig. 1. A uniquely determined set with the assumed row and column ordering, 
contain elements of Fi. We renumber the rows and columns such that we have 

^ > > . . . > r^^^ > 0, 

cS^) > > . . . > c« > 0, 

and such that all elements of F2 are contained in rows and columns with 
positive indices. By [HI Theorem 2.3] we have the following property of Fi 
(see Figure [1]): 

• in row i the elements of Fi are precisely the points 1), (^,2), . . . , (i, Tj-^^), 

• in column j the elements of Fi are precisely the points (1, j), (2, j), . . . , 

We will refer to this property as the triangular shape of Fi. 

Everywhere except in Section [6] we assume that |Fi| = IF2I. Note that we do 
not assume F2 to be uniquely determined. 

As Fi and F2 are different and Fi is uniquely determined by its line sums, F2 
cannot have exactly the same line sums as Fi. Define the difference or error 
in the line sums as 

As in general |t — s| = t + s mod 2, the above expression is congruent to 
E (c^ + ) + E (^i'^ + = + 2IF2I ^ mod 2, 

hence the error in the line sums is always even. We will denote it by 2a, where 
a is a positive integer. 

For notational convenience, we will often write p for \Fi fl F2I. 
We consider two problems concerning stability. 

Problem 1 Suppose Fi n F2 = 0. How large can \Fi\ be in terms of a? 
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Alpers et al. Theorem 29] proved that |-Fi| < o? . They also showed that 
there is no constant c such that < ca for all F\ and ¥2- In Section |4] of 
this paper we will prove the new bound |Fi | < a(l + log a) and show that this 
bound is asymptotically sharp. 

Problem 2 How small can |Fin-F2| be in terms o/|Fi| and a, or, equivalently, 
how large can \Fi\ be in terms 0/ |-Fi fl F2I and a? 

Alpers ([H Theorem 5.1.18]) showed in the case a = 1 that 

\Fi n F2I > + ^ - ^j2\Fi\ + I 

This bound is sharp: if = ^n{n + 1) for some positive integer n, then 
there exists an example for which equality holds. A similar result is stated in 
[21 Theorem 19]. 

While only deal with the case a = 1, we will give stability results for 
general a. In Section |5] we will give two different upper bounds for The 
bounds have different asymptotic behaviour. Writing p for |FinF2|, the second 
bound reduces to 

<p + l + ^2p + l 
in case a = 1, which is equivalent to 

p> \Fi\ - ^2\F\\. 

Hence the second new bound can be viewed as a generalisation of Alpers' 
bound. The first new bound is different and better in the case that a is very 
large. 

In Section |6] we will generalise the results to the case 7^ IF2I. 



3 Staircases 

Alpers introduced the notion of a staircase to characterise Fi A F2 in the case 
a = 1. We will use a slightly different definition and then show that for general 
a the symmetric difference Fi A F2 consists of a staircases. 

Definition 3 A set of points {pi,P2, ■ ■ ■ ,Pn) in 'S? is called a staircase ij the 
following two conditions are satisfied: 

• for each i with 1 < i < n — 1 one of the points pi and pi^i is an element of 
Fi\F2 and the other is an element of F2\Fi; 
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Fig. 2. A staircase. The set Fi consists of the white and the black-and-white points, 
while F2 consists of the black and the black-and-white points. The staircase is 
indicated by the dashed line segments. 

• either for all i the points p2i O'nd P2i+i ore in the same column and the points 
P2i+i and p2i+2 (11"^ the same row, or for all i the points p2i and P2i+i are 
in the same row and the points P2i+i and p2i+2 ore in the same column. 

This definition is different from in the following way. Firstly, the number 
of points does not need to be even. Secondly, the points pi and Pn can both 
be either in Fi\F2 or in F2\Fi. So this definition is slightly more general than 
the one used in p!p] for the case a = 1. 

Consider a point pi G Fi\F2 of a staircase {pi,p2, . . . ,Pn)- Assume pi-i is 
in the same column as pi and pi+i is in the same row as pi. Because of the 
triangular shape of Fi, the row index of must be larger than the row index 
of Pi, and the column index of p^+i must be larger than the column index of Pi. 
Therefore, the staircase looks like a real- world staircase (see Figure [2]). From 
now on, we assume for all staircases that pi is the point with the largest row 
index and the smallest column index, while p„ is the point with the smallest 
row index and the largest column index. We say that the staircase begins with 
Pi and ends with pn- 

Lemma 4 Let Fi and F2 he finite subsets of 1? such that 

• Fi is uniquely determined by its row and column sums, and 

• l^il = 1^2! • 

Let a be defined as in Section\^ Then the set Fi A F2 is the disjoint union of 
a staircases. 



PROOF. We will construct the staircases one by one and delete them from 
Fi A F2. For a subset A of Fi A F2, define 



Pi{A) = \{jeZ: (^, j) e An F,}\ - \{j e z : e An F2}\, lez, 

aj{A) = \{teZ:{i,3)eAnF^}\-\{ieZ:{i,j)eAn F2}|, J G Z, 
r(^) = ElP^(^)l+E 1^.(^)1- 

i 3 

We have 2a = t{F^ A F2). 
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Assume that the rows and columns are ordered as in Section |2l Because of 
the triangular shape of Fi, for any point G Fi\F2 and any point {k, I) G 
F2\Fi we then have k > i or I > j. 

Suppose we have deleted some staircases and are now left with a non-empty 
subset A of Fi A F2. Let {pi,P2, ■ ■ ■ ,Pn) be a staircase of maximal length that 
is contained in A. Let {xi,yi) and {xn,yn) be the coordinates of the points pi 
and Pn respectively. Each of those two points can be either in A (1 Fi or in 
An F2, so there are four different cases. (If n = 1, so pi and p„ are the same 
point, then there are only two cases.) We consider two cases; the other two 
are similar. 

First suppose pi G A fl Fi and p„ G A fl F2. If (x, yi) is a point of An F2 in 
the same column as pi, then x > xi, so we can extend the staircase by adding 
this point. That contradicts the maximal length of the staircase. So there are 
no points of An F2 in column yi. Therefore (Tyj^{A) > 0. 

Similarly, since p„ G A fl -F2 , there are no points of AdFi in the same column 
as Pn- Therefore <Jy„{A) < 0. 

All rows and all columns that contain points of the staircase, except columns 
yi and yn, contain exactly two points of the staircase, one in A fl Fi and 
one in A n F2. Let A' = A\{pi,p2, . ■ . ,Pn}- Then Pi{A') = Pi{A) for all i, 
and cTjiA') = (7j{A) for all j 7^ yi.Vn- Furthermore, ay^{A) = (Jy^iA) — 1 
and ay^{A') = cry„{A) + 1. Since o-y-^{A) > and o-y^{A) < 0, this gives 
t{A') = t{A) - 2. 

Now consider the case pi E A n Fi and p„ G A n Fi. As above, we have 
(Jyj^iA) > 0. Suppose {xn, y) is a point of An F2 in the same row as p„. Then 
y > yn, so we can extend the staircase by adding this point. That contradicts 
the maximal length of the staircase. So there are no points of An F2 in row 
Xn- Therefore Px„{A) > 0. 

All rows and all columns that contain points of the staircase, except column 
yi and row x„, contain exactly two points of the staircase, one in A n Fi 
and one in A fl F2. Let A' = A\{pi,p2, . . . ,Pn}- Then Pi{A') = Pi{A) for all 
i 7^ Xn, and crj{A') = crj{A) for all j ^ y\. Furthermore, 0"y^(y4') = Oy^i^A) — 1 
and Px„(^') = PxS^ - 1- Since OyJy^A) > and Px^A) > 0, this gives 
t{A) = t{A) - 2. 

We can continue deleting staircases in this way until all points of Fi A F2 have 
been deleted. Since t{A) > for all subsets A <Z Fi A F2, this must happen 
after deleting exactly a staircases. □ 

Remark 5 Some remarks about the above lemma and its proof. 
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(i) The a staircases from the previous lemma have 2a endpoints in total (where 
we count the same point twice in case of a staircase consisting of one point). 
Each endpoint contributes a difference of 1 to the line sums in one row or 
column. Since all these differences must add up to 2a, they cannot cancel 
each other. 

(a) A staircase consisting of more than one point can be split into two or more 
staircases. So it may be possible to write Fi A F2 as the disjoint union of 
more than a staircases. However, in that case some of the contributions of 
the endpoints of staircases to the difference in the line sums cancel each 
other. On the other hand, it is impossible to decompose Fi A F2 into fewer 
than a staircases. 

(Hi) The endpoints of a staircase can be in Fi\F2 or F2\Fi. For a staircase T of 
which the two endpoints are in different sets, we have |T fl Fi| = |r fl F2I. 
For a staircase T of which the two endpoints are in the same set, we have 
\TnFi\ = 1 + |TnF2| or iTnFa] = 1 + |TnFi|. Smce \F1\F2\ = \F2\F1\, 
the number of staircases with two endpoints in Fi\F2 must be equal to the 
number of staircases with two endpoints in F2\Fi. This implies that of the 
2a endpoints, exactly a are in the set Fi\F2 and a are in the set F2\Fi. 



Consider a decomposition of Fi A F2 as in the proof of Lemma |4j We will now 
show that for our purposes we may assume that all these staircases begin with 
a point pi e Fi\F2 and end with a point pn G F2\Fi. 



Suppose there is a staircase beginning with a point (x, y) G F2\Fi. Then there 
also exists a staircase ending with a point {x',y') G Fi\F2: otherwise more 
than half of the 2a endpoints would be in F2\Fi, which is a contradiction to 
Remark 5Iiii). Because of Remark 5li) we must have rl^^ < rl^^ and ^^^^ ^ ""^^^ 
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Because of Remark 5|^i 
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Let y" be such that {x',y") ^ Fi U F2. Delete the point {x,y) from F2 and 
add the point {x',y") to F2. Then r^^ decreases by 1 and r^?-* increases by 1, 
so the difference in the row sums decreases by 2. Meanwhile, the difference in 
the column sums increases by at most 2. So a does not increase, while Fi, \F2\ 
and |Fi A F2I do not change. So the new situation is just as good or better 
than the old one. The staircase that began with (x, y) in the old situation 
now begins with a point of Fi\F2. The point that we added becomes the new 
endpoint of the staircase that previously ended with {x',y'). 



Therefore, in our investigations we may assume that all staircases begin with a 
point of Fi\F2 and end with a point of F2\Fi. This is an important assumption 
that we will use in the proofs throughout the paper. An immediate consequence 
of it is that r^-^^ = r^"^^ for all i. The only difference between corresponding line 
sums occurs in the columns. 
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4 A new bound for the disjoint case 

Using the concept of staircases, we can prove a new bound for Problem [T] 
Theorem 6 Let Fi and F2 be finite subsets of 1? such that 

• Fi is uniquely determined by its row and column sums, 

• = IF2I, and 

• Fi n F2 = 0. 

Let a be defined as in Section^ Then 

a 

l^il<E 

i=l 

PROOF. Assume that the rows and columns are ordered as in Section [U Let 
a be the number of rows and h the number of columns that contain elements of 
-Fi. Let (/c, /) G -Fi. Then all the points in the rectangle {(i, j) : 1 < i < A;, 1 < 
j < 1} are elements of Fi. Since Fi and F2 are disjoint, none of the points in 
this rectangle is an element of F2, and all the points belong to Fi A F2. So all 
of the kl points must belong to different staircases, which implies a > kl. For 
all i with 1 < i < a we have (i,r|^^) G Fi, hence r^^^ < Since r^^^ must be 
an integer, we have 

a a 
i=l i=l 

Since (a, 1) G Fi, we have a < a, so 

a 

|i^il<E 

i=l 

□ 

Corollary 7 Let Fi, F2 and a be defined as in Theorem^ Then 

\Fi\ < a{l + log a). 



PROOF. We have 



"1 / p 1 \ 
< - < a (l + / —dx J = a (1 + log a) . 

□ 






\Fi\<i: 
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Fig. 3. The construction from Example [S] with in = 3. 

The following example shows that the upper bound cannot even be improved 
by a factor 2\lg2 ~ 0.72. 

Example 8 (taken from JIV) 

Let m > 1 be an integer. We construct sets Fi and F2 as follows (see also 
Figure |3]) . 

• Row 1: 

■ eFi for 1 <j <2-, 

■ (1, j) G F2 for 2" + 1 < j < 2"^+^ 

• Let < / < m - 1. Row i, where 2' + 1 < i < 2'+^: 

■ (ij) G Fi for 1 < j < 2"^-'-^ 

• (i, j) G F2 for 2"^-'-! + 1 < j < 2™"'. 

The construction is almost completely symmetrical: if G Fi, then G 
Fi; and if G -F2 with i > 1, then G -F2. Since it is clear from the 

construction that each row contains exactly as many points of Fi as points 
of F2, we conclude that each column j with 2 < j < 2™ contains exactly as 
many points of Fi as points of F2 as well. The only difference in the line sums 
occurs in the first column (which has 2'" points of Fi and none of F2) and in 
columns 2'" + 1 up to 2"^~^^ (each of which contains one point of F2 and none 
of Fi). So we have 

a = 2™. 

Furthermore, 

m—1 

= 2™ + ^ 2'2™-'-^ = 2"^ + m2'^-\ 
1=0 

Hence for this family of examples it holds that 



\Fi\ = a + -alog2a, 
which is very close to the bound we proved in Corollary [7j 



9 



5 Two bounds for general a 

In case F\ and F2 are not disjoint, we can use an approach very similar to 
Section |4] in order to derive a bound for Problem [2l 

Theorem 9 Let F\ and F2 be finite subsets of 1? such that 

• Fi is uniquely determined by its row and column sums, and 

• l^il = 1^2! • 

Let a be defined as in Section^ and let p = |Fi fl F2I. Then 

a+p 

\Fi\ < E 

1=1 

PROOF. Assume that the rows and columns are ordered as in Section [2l Let 
{k,l) G -Fi. Then all the points in the rectangle {{i,j) '■ 1 < i < k,l < j < 1} 
are elements of Fi. At most p of the points in this rectangle are elements of 
F2, so at least kl — p points belong to Fi A F2. None of the points in the 
rectangle is an element of F2\Fi, so all of the kl — p points of Fi A F2 in the 
rectangle must belong to different staircases, which implies a+p > kl. For all 
i with 1 < i < a we have {i,rl^^) G Fi, hence r^^^ < Since r^^^ must be 
an integer, we have 

a a 
1=1 i=l 

Since (a, 1) G Fi, we have a < a + p, so 

a+p 

\Fi\ < E 

i=l 

□ 

Corollary 10 Let Fi, F2, a and p be defined as in Theorem^ Then 

\Fi\ < (a + p)(l + log(a + p)). 

PROOF. Analogous to the proof of Corollary [7j □ 

The following example shows that the upper bound cannot even be improved 
by a factor ^ ^ 0.72, provided that a > 2iog2^-i log(p + !)• 

Example 11 



a+p 
i 



a + p 
i 



a+p 
i 
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Fig. 4. The construction from Example [TT] with A; = 3 and m = 4. 



Let k and m be integers satisfying k > 2 and m > 2fc — 2. We construct sets 
Fi and F2 as follows (see also Figures [i] and [s]) . 

• Row 1: 

■ (1, j) e Fi n F2 for 1 < J < 2^-\ 

■ (1, j) e Fi for 2'=-! + 1 < j < 2™ - 2'="! + 1, 

■ (1, j) e F2 for 2"^ - 2^-1 + 2 < J < 2"*+! - 2'= - 2^"^ + 2. 

• Let < / < A; - 2. Row i, where 2' + 1 < z < 2'+^: 

■ {t, 1) G Fi n F2, 

■ e Fi for 2 < j < 2™-'-i - 2'=-'-2 ^ 

■ (ij) G F2 for 2™-'^i - 2'=-'-2 + 2 < j < 2™-' - 2''-^-^ + 1. 

• Let A; - 1 < / < m - A;. Row i, where 2' + 1 < i < 2'+^: 

■ (2, j) G Fi for 1 < j < 2'"-'~\ 

■ (z, j) G F2 for 2™-'-i + 1 < J < 2™-'. 

• Let m - A; + 1 < / < m - 1. Row where 2' - 2'-™+'=-^ + 2 < z < 

■ (ij) G Fi for 1 < j < 2™-'-\ 

■ k j) e F2 for 2™-'-i + 1 < J < 2"-^ 

The construction is almost symmetrical: if G Fi, then (j, G Fi; if 

(i, j) G Fi n F2, then {j,i) G Fi n F2; and if G F2 with i > 1, then 

(j, i) G F2. Since it is clear from the construction that each row contains 
exactly as many points of Fi as points of F2, we conclude that each column j 
with 2 < j < 2™ — 2^^"^ + 1 contains exactly as many points of Fi as points 
of F2 as well. The only difference in the line sums occurs in the first column 
(which has 2"^ — 2*^"^ + 1 points of Fi and only 2^~^ of F2) and in columns 
2m _ 2^-1 + 2 up to 2™+^ - 2^" - 2^""^ + 2 (each of which contains one point of 
F2 and none of Fi). So we have 
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Fig. 5. The construction from Example |11| with k = 2 and m = 4. 

a = ^ ((2'" - 2'^-^ + 1) - 2''-^ + (2™+^ - 2^ - 2''-^ + 2) - (2™ - 2^=-^ + 1)) 
= 2™ - 2^= + 1. 

It is easy to see that 

Now we count the number of elements of Fi. 

• Row 1 contains 2"^ — 2^^^ + 1 elements of Fi. 

• Let < I < k - 2. Rows 2' + 1 up to 2'+^ together contain 2^(2'^-^-^ - 
2fc-z-2 + 1) = 2"-i - 2^-2 ^ 2' elements of Fi. 

• Let k — l<l<m — k. Rows 2' + 1 up to 2'+^ together contain 2' ■ 2™~'~^ = 
2m-i gig]2ients of Fi. 

• Letm-k + l<l<m-l. Rows 2' - 2'-™+'=-i + 2 up to 2^+^ - 2'"™+'= + 1 
together contain (2' - 2'-"+'=-i)(2™-'-i) = 2"-^ - 2'=^^ elements of Fi. 

Hence the number of elements of Fi is 



k~2 

\Fi\ = 2" - 2^'^ + l + {k- l)(2™-i - 2^-2) + ^ 2' 

z=o 

+ (m - 2A; + 2)2"^-^ + {k - l)(2™-i - 2'=-^) 
= 2"^ + m2™-i + 2^=-^ - /t2^-\ 

For this family of examples we now have 

|Fi| = a + pH — log2(a +p) H — log2(p+ 1). 
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We will now prove another bound, which is better if j9 = \Fi fl -F2I is large 
compared to a. Let u be an integer such that 2m = |Fi A F2I. We will first 
derive an upper bound on u in terms of a, b and a. Then we will derive a 
lower bound on |Fi| in terms of a, b and a. By combining these two, we find 
an upper bound on u in terms of a and p. 

Lemma 12 Let Fi and F2 he finite subsets of 7? such that 

• Fl is uniquely determined by its row and column sums, and 

• l^il = 1^2! • 

Let a, a and b be defined as in Section^ Define u as 2u = \Fi A F2I. Then 
we have 

< -{a + b){a + b + a - I). 



PROOF. Decompose Fi A F2 into a staircases as in Lemma |4| and let T be 
the set consisting of these staircases. Let T G T be a staircase and i < a + 1 
a positive integer. Consider the elements of T fl F2 in rows i, i + 1, . . . , a. If 
such elements exist, then let Wi(T) be the largest column index that occurs 
among these elements. If there are no elements of TnF2 in those rows, then let 
Wi{T) be equal to the smallest column index of an element of TflFi (no longer 
restricted to rows i, . . . , a). We have Wi{T) > 1. Define Wi = J^rer'^iiT). 

Let di be the number of elements of Fi\F2 in row i. Let yi < . . . < yd^ he the 
column indices of the elements of Fi\F2 in row i, and let y[ < . . . < y'^, he 
the column indices of the elements of F2\Fi in row i. Let 7^ C T be the set 
of staircases with elements in row i. The elements in F2\Fi of these staircases 
are in columns y[, y'2, ■ ■ ■ , y'^., hence the set {wi{T) : T e %} is equal to 
the set {y[, y'2, . . . , y'dj- The elements in Fi\F2 are in columns yi, y2, ■ ■ ■ , yd 
and are either the first element of a staircase or correspond to an element of 
F2\Fi in the same column but in a row with index at least i + 1. In either 
case, for a staircase T E % we have Wj+i(T) = yj for some j. Hence the set 
{wi+i(T) : T G %} is equal to the set {yi,y2, ■ ■ ■ ,ydi}- We have 

di di 

^ Wi+i{T) = J2yj< Jliyd.. - J + 1) = diyd, -M- ^)dh 
TeTi j=i j=i ^ 

and 

di di 

Wi{T) = J2yj^ J2(y<i^ +j) = ^iVd, + ^{di + 1)^^. 

Hence 



13 



> Wi+^ + -{di + l)di + ^{di- l)di 
^Wi+i + dl 

Since Wa+i > a, we find 

Wi>a + dl + --- + dl. 

We may assume that if {x, y) is the endpoint of a staircase, then {x, y') is an 
element of F1UF2 for 1 < y' < y (i.e. there are no gaps between the endpoints 
and other elements of Fi U F2 on the same row). After all, by moving the 
endpoint of a staircase to another empty position on the same row, the error 
in the columns can only become smaller (if the new position of the endpoint 
happens to be in the same column as the first point of another staircase, in 
which case the two staircases fuse together to one) but not larger, and u, a 
and b do not change. 

So on the other hand, as Wi is the sum of the column indices of the endpoints 
of the staircases, we have 

Wi<{b+l) + {b + 2) + --- + {b + a)^ab+ la{a + 1). 
We conclude 

a Y 

a + ^ < a6 + -ol{ol + 1). 
Note that di — u. By the Cauchy-Schwarz inequality, we have 



so 



From this it follows that 



\i=\ 



i=l 



1 V? 

ab + -a{a + 1) > a H , 

Zi Qj 

or, equivalently, 

< aab + -a{a — l)a. 

By symmetry we also have 

V? < aab + -a{a — 1)6. 

Zi 
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Hence 



< aab + -a{a — l)(a + b). 



Using that y/ab < we find 



<a \ ^ ^ ' + ^ ^ \ = -{a + b){a + b + a-l). 



Lemma 13 Let Fi and F2 he finite subsets of l? such that 

• Fi is uniquely determined by its row and column sums, and 

• l^il = 1^2! . 

Let a, a and b be defined as in Section\^ Then we have 

' ' - 4(a + l) 



□ 



PROOF. Without loss of generality, we may assume that all rows and columns 
that contain elements of Fi also contain at least one point F1AF2: if a row or 
column does not contain any points of Fi A F2, we may delete it. By doing so, 
Fi AF2 does not change, while becomes smaller, so the situation becomes 
better. 

First consider the case rl^^ < rl^"* — a for some i. We will show that this is 
impossible. If a column does not contain an element of F2\Fi, then by the 
assumption above it contains an element of Fi\F2, which must then be the 
first point of a staircase. Consider all points of F2\Fi and all first points of 
staircases in columns rj+i + 1, rj+i + 2, . . . , r^. Since these are more than a 
columns, at least two of those points must belong to the same staircase. On the 
other hand, if (x, y) G Fi\F2 is the first point of a staircase with rj+i < y < ri, 
then we have x < z, so the second point {x',y') in the staircase, which is in 
F2\Fi, must satisfy x' < i and therefore y' > r^. So the second point cannot 
also be in one of the columns rj+i + 1, rj+i + 2, . . . , r^. If two points of F2\Fi 
in columns r.j+i + 1, rj+i + 2, . . . , r j belong to the same staircase, then they 
must be connected by a point of Fi\F2 in the same columns. However, by 
a similar argument this forces the next point to be outside the mentioned 
columns, while we assumed that it was in those columns. We conclude that it 
is impossible for row sums of two consecutive rows to differ by more than a. 

By the same argument, column sums of two consecutive columns cannot differ 
by more than a. Hence we have r|^\ > r|^'' — a for all i, and c^-^^i > c^-^'' — a 
for all j. 
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Fig. 6. The number of points of Fi (indicated by small black dots) is equal to the 
grey area. 

We now have r^2^ > b — a, > b — 2a, and so on. Also, c^2^ > a — a, 
Cs^^ > a — 2a, and so on. Using this, we can derive a lower bound on 
for fixed a and b. Consider Figure [6] The points of Fi are indicated by black 
dots. The number of points is equal to the grey area in the picture, which 
consists of all 1 x 1-squares with a point of Fi in the upper left corner. We 
can estimate this area from below by drawing a line with slope a through 
the point (a + 1, 1) and a line with slope ^ through the point (6 + 1, 1); the 
area closed in by these two lines and the two axes is less than or equal to the 
number of points of Fi. 

For a = 1 those lines do not have a point of intersection. Under the assumption 
we made at the beginning of this proof, we must in this case have a = b and 
the number of points of Fi is equal to 

a{a + 1) ^ {(^ + 



2 - a + l 4(a + l)' 
so in this case we are done. 

In order to compute the area for a > 2 we switch to the usual coordinates in 
M^, see Figure [7| The equation of the first line is y = ax — a, and the equation 
of the second line is y = -x — -b. We find that the point of intersection is 
given by 

( aa — b —ba + a\ 

{x,y) = —2 — f ' ^ — r • 

\a^ — 1 a^ — 1 J 
The area of the grey part of Figure [7] is equal to 

1 aa — b 1 ba — a a^a + 6^a — 2ab 
2^' a^-1 ^ 2 ■ a2-l " 2(a2 - 1) ' 

We now have 

a{a^ + b^)-2ab ^ _ (g±^ _ (a + 5)2 

' 2(a2-l) - 2(a2-l) ~4(a + l)' 

□ 

Theorem 14 Let Fi and F2 be finite subsets ojl? such that 
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(0, -a) 



Fig. 7. Computing the area bounded by the two Unes and the two axes. 

Fi is uniquely determined by its row and column sums, and 
\Fi\ = I-F2I. 



Let a be defined as in Section\^ and let p = |Fi fl F2I. Write (3 = ^/a{a + 1). 
Then 



'a 



a — I 



\F^\<p+d-i^(3+^(3{a-l)+4{a + l)p + l3^+ ^ 



[a — l^a 
16 ■ 



PROOF. Write s = a + b for convenience of notation. From Lemma [12 
derive 

/ a — 1 



we 



"^Y ^+ 2 



We substitute = u + p in Lemma 13 and use the above bound for u: 

2 V 2 y ^ - ' " - 4(a + 1) 
Solving for s, we find 



s < + 1) + y ^foi[oL^ - 1) + 4(a + \)p + a(a + 1)^ 

= /? + + 1) + 4(a + \)p + 

Finally we substitute this in Lemma [T2j 



'a 



a — 1 



- V 4 r + V/^(« - 1) + 4(a + l)p + /32 + ^ 



(a — l)2a 
16 ■ 



This, together with = -u + p, yields the claimed result. □ 
Remark 15 By a straightforward generalisation of /H Proposition 13 and 



Lemma 16], we find a bound very similar to the one in Theorem I4 



l^ . s / (2a - 1)2 
\F^\<p+{a + l){a--) + {a + l)J2p + 
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Fig. 8. The construction from Example |16| with = 4 and a = 3. 



Theorem 
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says that |Fi| is asymptotically bounded by p + a.^ + o? . The 
next example shows that can be asymptotically as large as p^2Joq)^a. 



Example 16 



Let be a positive integer. We construct Fx and F2 with total difference in 
the line sums equal to 2a as follows (see also Figure [s]). Let (i,j) G Fi fl F2 
for 1 < ^ < A^, 1 < j < A^. Furthermore, for 1 < i < A^: 



Let (i, j), (j, i) e Fi n F2 for A^ + 1 < j < A^ + (A^ - i)a. 

Let (i, j), (j,i) e Fi for A^ + (A^ - i)a + 1 < J < A^ + (A^ - 2 + l)a. 

Let (i, j), (j,i) G F2 for A^ + (A^ - i + l)a + 1 < J < A^ + (A^ - i + 2)a. 



Finally, for 1 < t < a, let (z, j) G F2 with i = N + t and j = A^ + a + l-t. 



The only differences in the line sums occur in the first column (a difference of 
a) and in columns A^ + A^a + 1 up to A^ + A^a + a (a difference of 1 in each 
column). We have 

p = N^ + 2- ]^N{N - l)a = A^2 + A^^a - A^a, 

and 

|Fi| = AT^ + 2 ■ -N{N + l)a = A^^ + A^^a + A^a. 
2 
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From the first equality we derive 



N 



a 



+ 



+ 



2(a + l) \a + l 4(a + l)2 



Hence 



a 



\Fi\=p + 2Na = p H + 



a + 1 \a + l (a + 1)^ 



6 Generalisation to unequal sizes 



Until now, we have assumed that = |F2|. However, we can easily generalise 
all the results to the case |Fi| ^ \F2\. 

Suppose l-Fil > IF2I. Then there must be a row i with r|^^ > rl:'^\ Let j > b 
be such that ^ F2 and define F3 = F2 U {(z, j)}. We have r\ = r- + 1, 
so the error in row i has decreased by one, while the error in column j has 
increased by one. In this way, we can keep adding points until F2 together 
with the extra points is just as large as Fi, while the total difference in the 
line sums is still 2a. Note that p = \F1r\F2\ and |-Fi| have not changed during 
this process, so the results from Theorem 14 and Corollary [To] are still valid 
in exactly the same form. 

Suppose on the other hand that |-Fi| < IF2I. Then there must be a row with 
rf''^ < rf\ Let j be such that G F2\Fi and define F3 = F2\{{i, j)}. The 
error in row i has now decreased by one, while the error in column j has at 
most increased by one, so the total error in the line sums has not increased. 
We can keep deleting points of F2 until there are exactly points left, while 
the total difference in the line sums is at most 2a. 



By using \Fi A F2I = 2(|Fi| — p), we can state the results from Theorem 14 



and Corollary 10 in a more symmetric way, not depending on the size of Fi. 



Theorem 17 Let Fi and F2 be finite subsets of 1? such that Fi is uniquely 
determined by its row and column sums. Let a be defined as in Section\^ and 
let p = |Fi n F2I. Write p = y^{a + 1). Then 

(1) |Fi A F2I < 2 a + 2{a + p) \og{a + p). 

(2) |Fi A F2I < \ja [l3 + 0% - 1) + 4(a + l)p + /^^ + ^^^)' - I^^^. 
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